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Abstract. It is a well-known fact that genetic sequences may contain sections with repeated 
units, called repeats, that differ in length over a population, with a length distribution of geo- 
metric type. A simple class of recombination models with single crossovers is analysed that result 
in equilibrium distributions of this type. Due to the nonlinear and infinite-dimensional nature of 
these models, their analysis requires some nontrivial tools from measure theory and functional 
analysis, which makes them interesting also from a mathematical point of view. In particular, 
they can be viewed as quadratic, hence nonlinear, analogues of Markov chains. 

1. Introduction. Recombination is a by-product of (sexual) reproduction, which leads 
to the mixing of parental genes by exchanging genes (or sequence parts) between ho- 
mologous chromosomes (or DNA strands) . This is achieved through an alignment of the 
corresponding sequences, along with crossover events that lead to a reciprocal exchange 
of the induced segments. In this process, an imperfect alignment may result in sequences 
that differ in length from the parental ones; this is known as unequal crossover (UC). Im- 
perfect alignment is facilitated by the presence of repeated elements (as is observed within 
some rDNA sequences, compare |10j). and is believed to be an important driving mecha- 
nism for the evolution of the corresponding copy number distribution. The perhaps best 
studied case of repeated elements concerns microsatellites, see [9] and references given 
there for a summary. An important observation is that, within a population, the copy 
numbers vary, and often (at least approximately) follow a distribution of geometric type 
(meaning a geometric distribution or a finite convolution product thereof), see [9t[13|,l3] 
and references therein for some experimental examples and findings. 

The microsatellites themselves may follow an evolutionary course independent of each 
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Figure 1: Snapshot after an unequal crossover event as described in the text. Rectangles 
denote the relevant blocks, while the dashed lines indicate possible extensions with other 
elements that are disregarded here. 

other and thus give rise to evolutionary innovation. For a detailed discussion of these 
topics, see [9j [23] and references therein; for a brief introduction to molecular evolution, 
see also [8], or [25] for a thorough overview. In this paper, which is mainly based 
on previous work by Redner [19l [18], we shall focus on the distribution of the copy 
numbers only, and disregard further aspects of the possible evolution of the repeated 
units themselves. We rather aim at analysing some simple models in order to understand 
the observed copy number or repeat distributions. Moreover, we are primarily interested 
in models that preserve the mean copy number, though our setting will be adequate to 
accommodate also more general models. In view of possible applications to systems where 
the copy number (slowly) changes with time, it seems natural to set up a frame that can 
cope with such a situation as well. 

In the entire model class to be described below, one considers individuals whose 
genetic sequences contain a section with repeated units. These may vary in number, 
i € No = {0, 1, 2,3,.. .}, where i — is explicitly allowed and corresponds to no unit 
being present (yet). The composition of these sections (with respect to mutations that 
might have occurred) and the rest of the sequence are ignored here, as are details of the 
actual alignment process (e.g., whether partial loops of longer pieces are formed in order 
not to disturb the alignment outside the repeat region), see also [4] for a first discussion 
of possible models in this direction. 

In the course of time, recombination events take place in which a random pair of 
individuals is formed and their respective sections are randomly aligned, possibly im- 
perfectly with 'overhangs'. Then, both sequences are cut at a common position between 
two building blocks and their right (or left) fragments are interchanged. This so-called 
unequal crossover is schematically depicted in Figure [T] Obviously, the total number of 
relevant units is conserved in each event. While this is clearly a stochastic process, it 
is nevertheless interesting to investigate its deterministic limit, at least as a first step 
towards a better general understanding of this model class. To contribute to this first 
step, and to summarise what has been done in this direction so far, is the main aim of 
this contribution. 

2. Description of the deterministic limit. As a first step for the analysis of crossover 
dynamics, we assume the population size to be (effectively) infinite, i.e., large enough so 
that random fluctuations may be neglected (finite populations will briefly be mentioned 
later on). We write A4(X) for the (finite) measures on a space X, denote the restriction 
to positive measures by a superscript +, and indicate a restriction to measures of total 
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variation r by a corresponding subscript (see |20j or [27j for a short summary of the mea- 
sure theory needed here) . Then, the distribution of the copy numbers over our population 
is described by a probability measure (or vector) p e (No), which we identify with 
an element p = (p fc ) feeNo in the appropriate subset of ^ 1 (No)- Since we do not consider 
any genotype space other than No in this article, reference to it will be omitted in the 
sequel, so we write £ instead of •^ 1 (No) from now on. These spaces are complete in the 
metric derived from the usual £ norm, which is the same as the total variation norm 
here. The metric is denoted by 

d(p,q) = IIP = ^2\Pk~Qk\- C 1 ) 

fe>0 

Let us consider the above process (as well as various more general ones) on the level 
of the induced dynamics on the probability measures (i.e., in the infinite population limit 
mentioned above). With the notation just introduced, the dynamics can be described by 
means of the recombinator 

n ^ )i := H^T Yj T ij,kiPkPe- ( 2 ) 

Here, Tij^t > denotes the probability that a pair (k,£) turns into so, for normal- 

isation, we require 

T ijM = 1, for all k,£e N . (3) 

i,j>0 

The factor p k p e in ^ describes the probability that a pair (k, £) is formed, i.e., we assume 
that two individuals are chosen independently from the population. We assume further 
that, for all i,j,k,£, 

Tij^ki — Tji^i = Tij^k , (4) 

i.e., that Tij^e is symmetric with respect to both index pairs, which is reasonable and 
follows from the corresponding symmetry of the underlying process, compare Figure Q] 
Then, the summation over j in @ represents the breaking-up of the pairs after the recom- 
bination event. These two ingredients (symmetry and summation) lead to the quadratic 
nature of the iteration process, see below for more and O [15] for the appearance of 
similar types of equations in a different class of biological models. 

Condition ([3]) and the presence of the prefactor l/||p||i in the defining Eq. ([2]) make 
TZ norm non-increasing, i.e., ||72.(a;)|| 1 < ||aj||i, and positive homogeneous of degree 1, 
i.e., IZ(ax) — \a\lZ(x), for x £ £ x and a G R. Furthermore, TZ is a positive operator 
with ||7?.(:e)|| j = 1 1 1 1 x for all positive elements x e i 1 . Thus, it is guaranteed that TZ 
maps A4^, the space of positive measures of total variation r, into itself. This subspace 
is complete in the topology induced by the norm || . || x , i.e., by the metric d from ([1]). (For 
r = 1, the prefactor on the right hand side of ((2]) is redundant, but improves numerical 
stability of an iteration with the nonlinear mapping TZ.) 

Given an initial configuration p = p(0), the dynamics may be taken in discrete time 
steps, with subsequent generations, 

p(t + l) = TZ(p(t)), teN . (5) 
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This iteration reflects the following: due to random mating, it is sufficient to consider 
the dynamics at the level of the single strands, which will be combined into pairs again 
randomly in the next generation, according to the Hardy- Weinberg equilibrium [7j. 

Our treatment of this case will be set up in a way that also allows for a generalisation of 
the results to the analogous process in continuous time, where generations are overlapping, 

f t p(t) = g(TZ-l)(p(t)), t>0. (6) 

This reflects what is called instant mixing, i.e., the instantaneous formation of pairs, 
their recombination and separation. In other words, the actual duration of the diplophase 
(or "paired phase") is neglected, which is an approximation that is justified as long as 
recombination is rare on the time scale of an individual life span. 

Obviously, the (positive) parameter g in ([6]) only leads to a rescaling of the time t. 
We therefore choose g = 1 without loss of generality. Furthermore, it is easily verified 
that the fixed points of j5|) are in one-to-one correspondence with the equilibria of J6J. 
(In the sequel, we use the term fixed point for both discrete and continuous dynamics.) 

As mentioned above, our main interest at present is in processes that conserve the 
total copy number in each event, i.e., ki > for i + j = k + £ only. More general 
scenarios are possible, and also interesting, but already the concept of an equilibrium 
gets rather involved, whence we do not go into further details here. Together with the 
normalisation ([3|) and the symmetry condition from above, this yields 

■ t ST i+ 3 r rp k + l k + l 

i,j>0 i,j>0 i,j>0 

the second equality of which is an alternative condition that can replace the strict preser- 
vation of the copy number as follows. 

Lemma 1. Let 1Z be defined by |2]), with Tij^i > subject to the normalisation ([3]) and 
the symmetry conditions If also the second equality in ([7]) is satisfied, for all k,£ G No , 
the mean copy number in the population is preserved. 

Proof. This is a simple calculation, 

i>0 i,j,k,l>0 kj>0 k>0 

which shows the claim, provided that the mean m := ^ ipi is well-defined. ■ 
From now on, we use the symbol m for the mean, in order not to confuse it with 
summation indices and the like. 

3. Markov chains for comparison. Let us take a brief detour to look at the linear 
counterpart, a countable state Markov chain, in the deterministic limit of the forward 
equation for the time evolution of its probability distribution. To this end, consider again 
probability vectors p on No and define 

oo 
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for k £ No, where all M^e > together with X^fclo ^kt = 1 for all I £ No- This also makes 
the above sums well-defined on all elements of I 1 . Note that the matrix M = (Mke)k,eeN 
is transposed in comparison with the standard convention for Markov chains |21j , because 
we are using it here in a dynamical systems context, with action of the matrix to the 
column vector on the right. The time evolution now either reads 

p(t+l) = £DT(p(t)) (in discrete time) (8) 

or 

■4rp = (9Jt— l)(p) (in continuous time), (9) 

where the rate constant is again assumed to be 1, compare the remark after Eq. ©. 

The iteration of the discrete version ||5J) on I 1 is well-defined, while uniqueness of the 
solution of the initial value problem for the continuous time counterpart ([9]) on the same 
space follows from its global Lipschitz property, 

Wm^-miq)^ = mip-q)^ < ]T M M \ Pl -q,\ = £><-fc| =\\p-q\\ x , 

k,i>o e>o 

which holds for all p, q £ I 1 . The properties of the matrix M guarantee that the pos- 
itive cone as well as the simplex of probability vectors are preserved in forward time. 
Consequently, one can consider |[S} and Q as dynamical systems on I 1 . As the latter 
is a Banach space of infinite dimension, the unit ball is no longer compact in the norm 
topology, whence some extra care is needed for the results. 

As before, fixed points of (jHJ line up with equilibria of @, so that we speak of 
fixed points in both cases. Their existence is provided by Perron-Frobenius theory for 
countable state Markov matrices, see P21 Ch. 7.1] or [211 Ch. 5] for a detailed account. 
Irreducibility, aperiodicity and primitivity are defined as in the finite-dimensional case 
without difficulty. However, for meaningful results on eigenvalues and eigenvectors, one 
additionally needs the concept of recurrence, see [12j p. 197 f.] for a nice summary. 

The Perron value A emerges from the radius of convergence, p, of the power series 
T(z) = X)n>o( z -^0™ V ^ a P = V^- Clearly, we have p > 1 for a Markov matrix. If one 
diagonal entry (and then any) of T(z) diverges at 1 (so that p = 1 in this case, compare 
[21] Thm. 6.6]), the countable state Markov matrix M is called recurrent, where the 
behaviour of the diagonal element T{z)a, as z — > 1, reflects the expected number of 
recurrences to i, which is infinite in this case. Moreover, a unique normalised and strictly 
positive (right) eigenvector p £ M± exists with Wl(p) = p, see [211 Thm. 5.4]. This 
probability vector has the meaning of the unique equilibrium distribution and is the 
desired fixed point of the dynamics. 

Assume for a moment, in addition to the above conditions on 971, that 

Y, iM V = 3, for all j. (10) 

i>0 

As before, this is a sufficient condition for the mean to be preserved under the dynamics, 
because one has 

i>0 i>0 j>0 j>0 i>0 j>0 
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with the interchange of summation being permissible due to absolute convergence of the 
sums involved, provided that m = J2j JPj exists. However, a condition of type (fTOf is 
usually too restrictive for a linear system, wherefore we do not impose it here. As we 
shall see, the mean copy number can be preserved without it. 

A probability vector p is called reversible for 971 when the detailed balance equation 

M ktPi = M ikPk (11) 
holds for all k, £ € No- An important consequence is that any reversible p is automatically 
a fixed point of 371: 

vn(p) k = ^2,m uPi = J2M ekPk = p k . 

e>o e>o 

Reversibility often provides a simpler way to actually calculate a specific fixed point than 
the defining matrix eigenvalue equation. 

Since the Perron- Frobenius eigenvalue A need not be isolated in the spectrum of M , 
the convergence properties are more subtle than in the finite-dimensional situation. Under 
certain extra conditions (e.g., if A is isolated), the time evolution of an arbitrary initial 
condition converges exponentially fast towards the fixed point. However, when the matrix 
M is not only recurrent, but also positive recurrent, one has at least convergence of the 
discrete iteration, see [5T] for details. Here, positive recurrence means that the expected 
time for a return to the state i is finite, which is clearly stronger than mere recurrence. 

The standard geometric distribution with parameter a € (0, 1) is a discrete probability 
distribution on No, defined by the probability vector p with 

p n := a(l - a) n , for n £ N . (12) 

Clearly, p n > and J2n>oPn — 1> while m = X)n>o n P" = (1 — a )l a i so that a = 
l/(m + 1). If we define the matrix M — (My)ij>o by My = pi, one has 

(Mp)i = 22M ijPj = Pi2_, P j = Pi, 

3 3 

so that Mp = p. One clearly has M n = M for all n E N. Consequently, each entry of 
T{z) is a geometric series of the form My (1 + z + z 2 + . . .), which thus diverges at z = 1. 
In particular, M is (positive) recurrent. 

The matrix M does not satisfy Eq. (JTOj) . Nevertheless, the mean copy number is 
preserved in the following sense. Let a be an arbitrary probability vector with mean 
m, and p the geometric distribution according to (p~2|) with the same mean. With the 
corresponding matrix M, one then finds 

X! ' = X! X '''' X! m - 

i,i i,j i 3 

which results in the mean preservation, provided one starts with an initial condition a 
of mean m. Otherwise, the iteration maps a to an image of mean m in the first step, and 
preserves m in all subsequent iterations. 

Further eigenvectors of M are given by := e — e e for I £ N, where ej is the 
standard basis vector with 1 in coordinate i and otherwise. All these extra vectors belong 
to the eigenvalue 0, which is the only other eigenvalue of M. In fact, M is diagonalisable, 
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and it is not difficult to see that an arbitrary vector a = (a ,a 1 ,a 2 , ■■■) G t 1 can be 
written as a convergent expansion, a = /3p + X^>i(/^.P£ — a e) c l^^ where (3 = J2i>o a i- 
Consequently, the chosen eigenvectors of M form a basis of i x . If U = (p, q^ 1 ', q^ 2 \ ■ ■ ■) 
denotes the matrix that columnwise consists of the eigenvectors of M, one has 

M = £7diag(l,0,0, ...)U~ 1 , 

which makes the relation M n = M for n S N particularly transparent. Moreover, one 
sees that M commutes with all matrices N of the form N = UAU -1 where A has the 
block form 

'a tx 



A = 



A' 



with an arbitrary matrix A' . Restricting N so that M + N is still Markov, one can find 
multi-parameter families of Markov matrices that share the given stationary geometric 
distribution p. The same stationary probability vector p can thus arise from many other 
Markov chains as well. 

Let us now return to the bilinear counterpart to see which of these structural prop- 
erties possess an analogue, and to describe the setting of our later analysis. 



4. General structure of the bilinear system. Consider the crossover dynamics as 
defined by ([2]). Let us begin by stating the following general fact. 

Proposition 1. If the recombinator 1Z of ([2| satisfies the normalisation conditions (|3]). 
one has the global Lipschitz condition 

\\TZ(x)-n(y)\\ 1 < CWx-yW^ 

with constant C = 3 on I 1 , respectively C = 2 if x, y £ A4 r . 

Proof. Let x, y £ I 1 be non-zero (otherwise the statement is trivial). Then, one has 

x k x l Vk Vl 



n(x)-n(y)\\ 1 = E E T ijM 

i>0 j,k,l>0 
x k x £ Vk Vl T 
i,j>0 



< 



< 



E 

k,l>0 

E 



12/11 1 



x 



W-vt\ + 



k,e>o 

The last term becomes 
1 ii „ , , 



E 

k,e>o 
Vk 



i Ill/Hi 



x k Vl Vk Vi 



X 



X 



\\y\\i 



I*- v 1 + 



Ii lli/lli 
\\\\y\\i x - 



x 



\y\\i x - ll^lliy||a = 



\y\\i-\\x\\ 1 )x+\\x\\ 1 {x-y)\\ 1 <2\\x-y\\ 1 , 



from which ||72.(a;) — 1Z{y)\\ 1 < 3\\x — y\\ x follows for x. y G I 1 . If x, y G A4 r , one has 
ll^lli = llylli an< i the above calculation simplifies to \\lZ(x) — 1Z{y)\\ 1 < 2\\x — y\\ x . m 

In continuous time, this is a sufficient condition for the existence of a unique solution of 
the initial value problem ©, compare [TJ Thms. 7.6 and 10.3]. 
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It is instructive to generalise the notion of reversibility. We call a probability vector 
p G M\ reversible for a recombinator TZ of the form |j2]) if, for all i, j, k, £ > 0, 

T i 3 ,kiPkPe = T ki, t3 PiPj- (I 3 ) 
Though this set of equations for detailed balance is much more restrictive than its lin- 
ear counterpart in Eq. Ijlljl . the relevance of this concept is evident from the following 
property. 

Lemma 2. If p £ is reversible for TZ, it is also a fixed point of TZ. 
Proof. Assume p to be reversible for TZ. Then, by Q, 

n (p)i = T ij,uPkPt = i), ,, />,/>, /', X!''.' = p i> 

j,k,e>o j,k,e>o j>o 

for all i £ No, which shows the claim. ■ 

Returning to the original question of the existence of fixed points, we now recall the 
following facts, compare [6J, [22] for details and proofs, which are needed for some general 
statements in the fixed point discussion. 

Proposition 2. 27, Cor. to Thm. V.1.5] Assume the sequence (p^) in Mf to converge 
in the weak-* topology (i.e., pointwise, or vaguely) to some p £ M^, i.e., 

lim p^ — p k for all k S No , with p k > and J2k>oPk = 1 ■ 

Then, it also converges weakly (in the probabilistic sense) and in total variation, i.e., 
lim^oollpW-pH^O. - 

Recall from [6] that a set of measures M C M.^ is called tight when, for every e > 0, 
there is an m £ No such that J2k>mPk < e ' simultaneously for all p £ M.. This is a 
uniformity condition which serves as a condition for the compactness needed later on. 

Proposition 3. Assume that the recombinator TZ from (|2|) satisfies the normalisation 
(J3j) and possesses a convex, weak-* closed invariant set M. C A4f , i.e., TZ(A4) C M, 
that is tight. Then, TZ has a fixed point in M. . 

Proof. Prohorov's theorem [22j Thm. III. 2.1] states that tightness and relative compact- 
ness in the weak-* topology are equivalent (see also [6] Chs. 1.1 and 1.5]). In our case, M. 
is tight and weak-* closed, therefore, due to Proposition [2l norm compact. Further, M. 
is convex by assumption, and TZ is (norm) continuous by Proposition [T] Thus, the claim 
follows from the Leray-Schauder-Tychonov fixed point theorem [20J Thm. V.19]. ■ 

For several explicit models, we shall see that such compact invariant subsets indeed exist. 
On the other hand, once again due to the infinite-dimensional nature of the dynamical 
system, their identification and use for the various proofs is essential. 

5. Takahata's model. An early and now classic example was given by Takahata [24] . 
In our terminology, he used a recombinator based upon the transition probabilities 

T HM '■= k + £ +1 5 i+j-k+e i (I 4 ) 

for i, j, k, I € No- Observing card{(i, j) | i,j € No, i + j = k + 1} = k + I + 1, it is clear 
that T just describes a recombination with uniform distribution of the copy number pairs 
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(k, £) on each (finite) block of possibilities with k + £ fixed. One can also check, via Eq. ([7|) 
and Lemma [TJ that the mean m is preserved. On the basis of Eq. (fl4|) . the action of the 
recombinator from Eq. ((2]) on probability vectors p simplifies to 

*(p)< = E *T7TT^- (15) 

k,e>o 

k+l>i 

Though this model is mathematically rather transparent, it lacks a good intuitive justi- 
fication on the level of the biological processes. Nevertheless, its properties seem to be 
in acceptable agreement with at least some of the observations, compare [13j[3], though 
other results, as those shown in [9], indicate that also other types of equilibria appear in 
experiment. 

Proposition 4. The probability vector p defined by 

1 / \ n 
m + 1 \m + 1 / 

is a reversible equilibrium with mean m for the dynamics based on T of (|14|) . 

Proof. Using standard identities with geometric series and their derivatives, it is easy to 
check that p indeed defines a probability vector on No with mean m. Detailed balance 
follows from a simple calculation, 



„ _ 5i+ ,k+i ( 1 

hj,MPkPt - k + e + 1 \^Ti) {^Ti 



k+e 



i+j 



i + j + 1 \m+l J \m+l / 
thus completing the claim by means of Lemma [2] ■ 

These equilibria are geometric distributions as also discussed above in the Markov 
context. However, in view of some experimental findings reported in [9] and further 
arguments put forward in [23j . one would like to see an initial rise, and perhaps also a 
maximum in the vicinity of n ~ m. One should note that measurements often skip the 
entries for small copy numbers (which seem to be rather unreliable), so that a graph with 
a power law decay need not indicate the absence of some (weak) form of a maximum. 
As the methods for the further analysis of Takahata's model are similar to what we need 
later on for alternative models, we first continue to investigate Takahata's model. 

Theorem 1. If the initial condition, with mean m, satisfies a certain tightness condi- 
tion (limsup^^ yP]Jfi) < 1), the dynamics, both in discrete and in continuous time, 
converges to the equilibrium vector p from Proposition^ with lim^oo \\p(t) — p\\± = 0. 

The proof of this theorem, quite appropriately for the present context, uses an approach 
via generating functions and then relies on Banach's contraction principle. It requires 
several preparatory steps. 

Let a and S be fixed, with < a < 6 < oo, and consider the space 

X a ,s ■= {a = (a k ) keNo | a — 1 , a 1 — a , and < a k < 5 k for all k > 2} . (16) 
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If equipped with the metric 

dia.b) = J2d k \a k -b k \, (17) 

k>0 

where df. = ("f/5) k for some < 7 < |, the space X a .s is compact [19] Prop. 5]. 
Let us define a new vector, b(p), for suitable p, by 

h ^ ■= E(T) % ' ( 18 ) 

which is certainly well-defined for all p with limsupk^^ {/pTJ < 1, by an application 
of [IHl Prop. 6]. This proposition also clarifies the connection with the space X a s for 
suitable parameters a and S. As we shall see, X a ^ is an example of a compact, convex 
space that is invariant under the recombinator dynamics. It is easy to check that one has 
b(p) — 1 and b(p) 1 — m, so that we need X a ,s with a = m and 6 > m. 

Lemma 3. For any p with limsup^^ ^/p^ < 1, one has the convolution identity 

1 k 

b(K(p)) k = £ 6(p) m 6(p) fe _ m . 

m=0 

Proof. Let p be an arbitrary probability vector with limsup^^ ^/pTf < 1, so that the 
mapping b is well-defined. The left hand side leads to 



fi^> h. V / tf^t. V / ~ ,,-■>» n 



r + s + 1 

£>fc x 7 £>fc x ' r,s>0 

r+s>£ 

r+s 

[ k jPrPs 

r,s>0 ' ' l=k vv ' r,s>0 



PrPs fA 1 

^ r+s+l^\fc/ fc+1 ^ 



where a standard identity on binomial coefficients was used in the last step. 
On the other hand, one finds 

k 

E b (P)rnHP)k-m. = 
m=0 

m+n—k 

= E E Q (*) jvp. - E w 8 E Q (* 

m,n>0 r : s>0 v 7 v y r,s>0 m,n>0 x 7 x 

m-\-n—k r+s>fc r+s> A; m+n=fc 




E 

r,s>0 



r + s 



again using a standard identity, together with the fact that (^j = for m > n when n 
is an integer. A comparison of the two calculations establishes the claim. ■ 

The further relevance of Lemma [3] stems from the following property of the generating 
function of p, defined by ip(z) = ^2,^>QPt z ■ When rewritten as a Taylor series around 1 
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rather than around 0, one obtains 

^z) = e^ = E ( E (f) p<l (* - - E 5 (p) fe - !)*• ( 19 ) 

Under the assumptions on p, the radius of convergence of ip(z) is larger than 1, so 
that this calculation is on firm grounds. Lemma [3J now tells us that we may study the 
recombination action on the level of the expansion coefficients. 

Let us therefore define the induced recombination operator TZ on any space of type 
X a ,s, with 5 > a, by TZ(b(a)) — b(lZ(a)), which establishes a commuting diagram of the 
mappings TZ and TZ in the obvious way. More precisely, one first restricts the action of 
1Z to a suitable subspace of Mf, so that the mapping b is well-defined. If p satisfies the 
condition of Lemma [31 so that the radius of convergence of ij; exceeds 1, the probability 
vector p is also completely determined by its moments, compare Thm. II. 12. 7] to- 
gether with the observation that ^(e lt ) is the (convergent) moment generating function 
of p. As all moments, in turn, are specified by the entries of b(p), the latter uniquely 
determines p in this situation. 

It is easy to check that the vector (l,a,a 2 , . . .) is a fixed point of TZ in X a> s, for 
any 8 > a. Choosing a — m, this vector is the image of the probability vector p from 
Proposition [3] under the mapping b. 

Proposition 5. On X a ^, the map defined by TZ is a contraction. In particular, it is a 
globally Lipschitz continuous mapping of X a ^ into itself. 

Proof. Let 6 > a > be given, as well as arbitrary a,b <E X a> g. Clearly, we have TZ(a) Q = 1 
and TZ(a) 1 = a. For k > 2, one finds TZ(a) k = ^-j-j- X^=o a e a k-e — ^ '• This proves that 
TZ maps X a $ into itself. 

The space X a ^ is equipped with the metric d from (|17p . Since, due to b £ X a j, also 
TZ(b) = 1 and TZ(b) 1 = a, the contraction estimate reads as follows. 

d{TZ(a),TZ(b)) = ^ \^(a e a k _ e - b e b k _ e ) | = ^ -j^- |E( a <?- b t)( a k-t + b k-e) 

k>2 1=0 k>2 £=0 

k>2 1=2 t>2 k>t 1 

With the choice dk — (7/<5) fc , where we had 7 < |, we can now find, for I > 2, an upper 
bound for the inner sum, 

k>£ 1 k>e ' 

which, together with the previous calculation, proves the contraction property, 

d(TZ(a),TZ(b)) < Cd(a,b), 

with contraction constant C < 1. Clearly, this also means that TZ is globally Lipschitz 
continuous. ■ 



12 



MICHAEL BAAKE 



This shows that, in discrete time, we have exponentially fast convergence of the se- 
quence (lZ n (a)) n >x, with a £ X a ^, to a unique fixed point in X a ^. It is specified by the 
mean copy number m of the probability vector p that underlies a — b(p), via a — m, 
see above. Clearly, this fixed point (in X a .g) is the image (under b) of the equilibrium 
vector p £ Aif calculated earlier in Proposition |4j as the mapping b is invertible in this 
situation. The claim of Theorem [T] for discrete time is now clear, with exponentially fast 
convergence to the equilibrium, from any initial condition as specified there. 

For the slightly more involved treatment of the continuous time case, we refer to [19| . 
It is based on the construction of a Lyapunov function, similar to that of [191 Prop. 13]. 

6. Internal crossover. Another rather obvious model is based on the assumption that 
the shorter of the two sequences (or stretches) can align with any connected block of the 
longer sequence, but without any overhang. This situation has been coined internal un- 
equal crossover, or internal crossover for short. Here, restricting to probability measures 
on No, the recombinator ([2|) simplifies to 

E ttf^- (20) 

k,e>o 1 1 

kAe<i<kve 

where k A £ (k V €) stands for the minimum (the maximum) of k and £, see [331 HH1 EH] 
for details on this model. We choose the notation TZq for reasons that will become clear 
later on. 

In our search for fixed points, it is again useful to look for probability vectors that are 
reversible for 7£ - Since both forward and backward transition probabilities are simulta- 
neously non-zero only when {i, j} = {k,i} C {n,n + 1} for some n, the components p k 
may only be positive on this small set as well. By the following proposition, this indeed 
characterises all fixed points of this case. 

Proposition 6. A probability measure p £ M.~l is a fixed point of IZo if and only if 
its mean copy number m = J2k>o^Pk * s finite, together with Py m ^ = L m J + 1 — m » 
Pp m -| = m + 1 — [m], and p k = for all other k. This includes the case that m is a 
non-negative integer, where Py m ^ = P\ m -\ = P m = !• 

Proof. The 'if follows easily by insertion into (fl3jl and Lemma [2] For the 'only if part, 
let i denote the smallest integer such that p i > 0. Then, 

ft(p)i = Pi + 2 Pi Yl y+J = Pi \ Pi + Pi+1 + ^ ITT Pt+e - Pl ' 

e>i y e>2 J 

where the last step follows since j^- < \ j n the last sum. One has equality precisely when 
p k = for all k > i + 2. This implies m < oo and the uniqueness of p (given m) with the 
non-zero frequencies as claimed. ■ 

In this case, one may select a compact subset within the probability vectors by de- 
manding the existence of the centred r-th moment, for some fixed r > 1. More precisely, 
with 

A*«(P) := \ e ~ m \ S Pe> 
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one considers the set 

Mt im , c ■= {ptMt\ E k k Pk = m and Mr (p) < C} (21) 

for an arbitrary, but fixed C < oo, equipped with our usual metric as introduced before 
in |T]). This gives a compact and convex space [TH1 Lemma 2]. Moreover, one has 

Lemma 4. Let r > 1 be fixed and consider the space Ml mC of ([21]). Then, both /i 1 ana 
ji r satisfy 

with equality if and only if p is a fixed point of TZq . 

Moreover, /u, 1 : Al^ m c — ► R>o is continuous and defines a Lyapunov function for 
the dynamics in continuous time. 

Proof. To show the first claim, consider 



m s CMp)) = E E i + \k-i\ VkVl 

i>o k,e>o 1 1 



kr\l<i<k\ll 



1 kve 

k,e>o 1 1 i=kr\t 



(22) 



For notational convenience, let j = k + £ — i. We now show 

\i-m\ s + \k + £~i ~m\ s < \k - m\ s + \£ - m\ s . (23) 

If {k,£} — {i,j}, then (J22J) holds with equality. Otherwise, assume without loss of gen- 
erality that k < i < j < £. If m < k or m > £, we have equality for s — 1, but a strict 
inequality for s — r due to the convexity of x i— > x r . (For s = 1, this describes the fact 
that a recombination event between two sequences that are both longer or both shorter 
than the mean does not change their averaged distance to the mean copy number.) In the 
remaining cases, the inequality is strict as well. Hence, (i s (TZo(p)) < M s (p) wrtn equality 
if and only if p is a fixed point of TZq , since otherwise the sum in (1221) contains at least 
one term for which (|23[) holds as a strict inequality. 

To see that /i 1 is continuous, consider a convergent sequence (p^) in M-i m q and the 
random variables = \K^> — m|, where the 2fW are independent No-valued random 
variables with laws p( n \ Due to the structure of M^mCi ^ ne ran dom variables 
are uniformly integrable, which implies the convergence of the corresponding expectation 
values by Thm. 25.12]. This, in turn, is nothing but the continuity of Since ^\{p) 
is linear in p and thus infinitely differentiable, so is the solution p(t) for every initial 
condition p £ ■Mt m c , compare 1, Thm. 9.5 and Remark 9.6(b)]. Therefore, we have 

Mp ) = liminf ^Mtollgo) = ^(K (p )) - fiM < 0, 

again with equality if and only if p is a fixed point. Thus, fi 1 is a Lyapunov function as 
claimed. ■ 

Finally, this gives the following convergence result, the proof of which is given in [TU] 
and not repeated here. 
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Theorem 2. Assume that, for the initial condition p(Q) and fixed r > 1, the r-th moment 
exists, \i r (p) < oo . Then, m = is finite and, both in discrete and in continuous 

time, limt-xxj \\p(t) — p||i = with the appropriate fixed point p from Proposition^ m 
Let us mention that, for q = 0, the recombinator can be expressed in terms of explicit 
frequencies ir k .e of fragment pairs before concatenation (with copy numbers k and £) as 
T^o(p)i = S}=o Tjjt-J - ^ ^ s as an °P en question whether this can be used to simplify 
the above treatment. 



7. Random crossover. This model deviates from the previous one in that it admits 
arbitrary overhangs, up to the case where, after the crossover, one sequence got it all 
while the other lost everything. The possible alignments for any pair are supposed to be 
equally likely, so that the recombinator (|2|) , again restricted to the probability measures, 
now reads 

v 1 + min{fc, i, i, k + £ — i} 
(k + !){£+!) 



Ki[P)i = 2^ 7Z i ^^t^J 1 T\ PkPf ( 24 ) 



k,e>o 

k+l>i 



As for our previous two examples, using Lemma[2]once again, the reversibility condition, 

for all k + £ — i + j , 



/',, Pi Pi Pj 



fc+1^+1 

leads to an expression for fixed points. In fact, these relations have p k = C(k + l)x k as 
a solution, with appropriate parameter x and normalisation constant C. Again, it turns 
out that all fixed points are given this way, as was originally noticed (in a different way) 
in [23, Thm. A.2]. 

Proposition 7. Every fixed point p € of IZi has finite mean m = ^2 k kp k , and is 
uniquely specified by the value ofm. Explicitly, one has 

2 / 

m 



UT2J {k + 1 \ m 

with k £ Nq. m 

One can verify this in several ways, one being a direct calculation via induction. Interest- 
ingly, this equilibrium is the convolution of two geometric distributions (of equal mean 
m/2), and hence also of geometric type according to our terminology (which follows that 
of [23 ). It might be interesting to explore this observation a little further in the future. 

At this point, one can define, very much in analogy to the situation in Takahata's 
model above, an induced recombinator, TZx, acting once more on spaces of the form X a j. 
It is given as 

£i(p) = a(fti(p)) 
where the mapping a is defined by 



l>k 

It is thus closely related to our above mapping b. 

The main result on this model, proved in detail in [H)l [T3| . reads as follows. 
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Theorem 3. Assume that limsup^^ \Zpu (0) < 1. Then, both in discrete and in con- 
tinuous time, Hindoo \\p(t) —p\\i = 0, where p is the corresponding fixed point according 
to Proposition^ 

Proof. The proof is very similar to the one used above for the Takahata model, and 
employs once again Banach's contraction principle for the induced action of IZi on X a j. 
Since all details have been given in |19j . we omit them here. ■ 

The fixed points of Proposition [7] are of the expected geometric type, and are perhaps 
more realistic than those of the Takahata model, at least for cases where a maximum 
is present in the repeat distribution. However, one should note that the experimental 
situation is not completely convincing at present, so that it seems advantageous to have 
a versatile model class at hand. 

8. An interpolation. When considering the recombinators TZq and 7£i in comparison, 
one would like to find further models that share properties of both of them, or interpolate 
between them in a suitable way. In particular, is unrealistic due to the complete 
confinement of the shorter bit within the range of the longer one, while 1Z\ poses no 
restriction at all for any kind of overhang. One such interpolation was initially investigated 
in discrete time by Atteson and Shpak in [23] , based on preceding work by Ohta [17] and 
Walsh [26], see also [El [18] for more. The interpolation employs a penalty function idea 
for overhangs of the shorter sequence, and leads (in the above language) to a recombinator 
TZ q with < q < 1. The latter is based upon the transition probabilities 

4i = Cu <W+* (1 + nm{k, h j}) q ^-^) , (25) 
where k V £ :— max{fc,^}, k A I := min{fc,^}, and 0° = 1. The normalisation constants 
Cj$ are chosen such that ^ holds, i.e., J2i j>o ^Ifki = 1- These constants are symmetric 
in k and I and read explicitly 

U (k A £ + l)(|fc + 1)(1 - q) 2 + 2q(k A i - (k A £ + l)q + ' 

Note further that the total number of units is indeed conserved in each event and that 
the process is symmetric within both pairs. Hence ([7|) is satisfied. 

Unfortunately, the situation with the fixed points is a lot more complicated due to 
the following result. 

Proposition 8. For parameter values q S (0, 1), any fixed point p S Aif of the recom- 
binator lZ q , given by ([2]) and (|25p . satisfies p k > for all k > (unless it is the trivial 
fixed point p = (1, 0, 0, . . .) we excluded). None of these extra fixed points is reversible. 

Proof. Let a non-trivial fixed point p be given and choose any n > with p n > 0. Observe 
that n _ hnn > for < q < 1 and hence 

Pn±l = 'Rq(P)n±l = ^nll j.klPkPl ^ n-l,tm Pn Pn >0- 

j,k,l>0 

The first statement now follows by induction. For the second statement, evaluate the 
reversibility condition ([T3"]) for all combinations of i, j, k, I with i + j = k + £<4. This 
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leads to four independent equations. Three of them can be transformed to the recursion 

>« = w^hf/«->' kf - {2 - 3Ah 

from which one derives explicit equations for all p k with k S {2, 3, 4} in terms of p and 
Pi- Inserting the one for p 2 into the remaining equation yields another equation for p 4 
in terms of p and p 1 , which contradicts the first equation for all q € (0, 1), as is easily 
verified. ■ 

Nevertheless, the dynamics is well defined, and respects the compact subsets defined 
above in forward time, compare [TH1 Thm. 4]. Based upon the analysis in [151 119] . and 
further numerical work on the fixed points, it is plausible that, given the mean copy 
number m, never more than one fixed point for lZ q exists. Due to the global convergence 
results at q = and q = 1, any non- uniqueness in the vicinity of these parameter 
values could only come from a bifurcation, not from an independent source. Numerical 
investigations indicate that no bifurcation is present, but this needs to be analysed further. 

Moreover, the Lipschitz constant for the corresponding induced recombinator lZ q can 
be expected to be continuous in the parameter q, hence to remain strictly less than 1 
on the sets X a> s in the neighbourhood of q = 1. So, at least locally, the contraction 
property should be preserved. For further progress, it seems advantageous [IT] to use a 
rather different approach based on the analysis of similar problems in evolutionary game 
theory. Here, one would aim to establish a slightly weaker type of convergence result for 
all < q < 1, and probably even on the larger compact set -Mi m q from Eq. (f2Tj) . 

9. Open problems and outlook. The results for the various models presented here 
show that initial configurations, subject to some specific conditions that are no restric- 
tion in practice, converge to one of the known fixed points. These results apply to the 
deterministic dynamics of the infinite population limit . 

In view of the biological applications, one is also interested in possible deviations from 
this picture on the level of large, but finite, populations, i.e., for the underlying stochastic 
process, e.g., a variant of the Moran model with unequal crossover. In this model class, 
however, important deviations seem unlikely, due to the known convergence results for 
the infinite population limit, see [2] for more. 

Since the above equilibrium distributions have finite support or are exponentially 
small for large copy numbers, one can also expect these systems to behave very much 
like ones with only finitely many types. In this sense, the results are typical, and the 
more general setting with probability vectors on No is adequate. This is also supported 
by several simulations [TSJ. 

Still, an open question is a more complete understanding of the regime q € (0, 1) 
in Section 8. Due to the loss of reversibility of the fixed points, the analysis becomes 
rather involved. Preliminary investigations [18] have not given any hint on values of q 
where convergence fails or where alternative stable fixed points show up, though this is 
presently only based on numerical experiments and perturbative arguments. It might be 
advantageous (and perhaps also more realistic) to search for other ways to interpolate 
between the cases q = and q — 1, preferably ones that maintain the reversibility of the 
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equilibria. This question certainly deserves further attention. 
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